The Condition Number of the PageRank Problem
نویسندگان
چکیده
We determine analytically the condition number of the PageRank problem. Specifically, we prove the following statement: “Let P be an n × n row-stochastic matrix whose diagonal elements Pii = 0. Let c be a real number such that 0 ≤ c < 1. Let E be the n × n rank-one row-stochastic matrix E = ev , where e is the n-vector whose elements are all ei = 1, and v is an n-vector that represents a probability distribution. Define the matrix A = [cP + (1 − c)E] . The problem Ax = x has condition number κ = (1 + c)/(1− c).” This statement has implications for the accuracy to which PageRank can be computed, currently and as the web scales. Furthermore, it provides a simple proof that, for values of c that are used by Google, small changes in the link structure of the web do not cause large changes in the PageRanks of pages of the web. 1 Theorem Theorem 1. Let P be an n× n row-stochastic matrix whose diagonal elements Pii = 0.. Let c be a real number such that 0 ≤ c ≤ 1. Let E be the n × n rank-one rowstochastic matrix E = ev , where e is the n-vector whose elements are all ei = 1, and v is an n-vector that represents a probability distribution1. Define the matrix A = [cP + (1− c)E] . The problem Ax = x has condition number κ = (1 + c)/(1− c). 2 Notation and Preliminaries P is an n× n row-stochastic matrix whose diagonal elements Pii = 0. E is the n× n rank-one row-stochastic matrix E = ev , where e is the n-vector whose elements are all ei = 1 and v is an n-vector whose elements are all non-negative and sum to 1. A is the n× n column-stochastic matrix: A = [cP + (1− c)E] (1) We let x be the dominant eigenvector of A. By convention, we choose eigenvectors x such that ||x||1 = 1. Since A is a non-negative matrix, the dominant eigenvector x is also non-negative. Therefore, ex = ||x||1 = 1 (2) Since A is column-stochastic, it’s dominant eigenvalue λ1 = 1, 1 ≥ |λ2| ≥ . . . ≥ |λn| ≥ 0. That is, Ax = x (3) 1 i.e., a vector whose elements are nonnegative and whose L1 norm is 1. 3 Proof of Theorem 1 We prove this case via a series of lemmas.
منابع مشابه
The Evaluation of the Team Performance of MLB Applying PageRank Algorithm
Background. There is a weakness that the win-loss ranking model in the MLB now is calculated based on the result of a win-loss game, so we assume that a ranking system considering the opponent’s team performance is necessary. Objectives. This study aims to suggest the PageRank algorithm to complement the problem with ranking calculated with winning ratio in calculating team ranking of US MLB. ...
متن کاملApplication of PageRank Model for Olympic Women’s Taekwondo Rankings: Comparison of PageRank and Accumulated Point Index System
Background. Although the World Taekwondo federation currently applies the APIS ranking method to calculate the Olympic rankings, some limitations exist. Objectives. This study applies the PageRank model to Olympics Taekwondo rankings. Methods. The 2015-2018 World Taekwondo Grand Prix competition results for women’s four weight classes (-49kg, -57kg, -67kg, +67kg) were used as research data, t...
متن کاملExtrapolation and minimization procedures for the PageRank vector
An important problem in Web search is to determine the importance of each page. This problem consists in computing, by the power method, the left principal eigenvector (the PageRank vector) of a matrix depending on a parameter c which has to be chosen close to 1. However, when c is close to 1, the problem is ill-conditioned, and the power method converges slowly. So, the idea developed in this ...
متن کاملA note on the PageRank algorithm q
In this paper we present some notes of the PageRank algorithm, including its L1 condition number and some observation of the numerical tests of two variant algorithms which are based on the extrapolation method. 2005 Elsevier Inc. All rights reserved.
متن کاملA note on the PageRank algorithm
In this paper we present some notes of the PageRank algorithm, including its L1 condition number and some observation of the numerical tests of two variant algorithms which are based on the extrapolation method. 2005 Elsevier Inc. All rights reserved.
متن کامل